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Research  Objective 

The  primary  goal  of  this  research  was  to  investigate  DSP  hardware  techniques  to  support 
the  sharing  of  transmit  hardware  and  array  apertures  between  multiple  simultaneous  users 
of  Navy  RF  Systems,  specifically  with  respect  to  the  implementation  of  delta-sigma 
converter  based  DSP  systems.  In  particular,  we  considered  (1)  tradeoffs  between  speed, 
space  and  power  (2)  finite  precision  analysis  techniques  to  effect  efficient  hardware 
designs,  and  (3)  algorithm  and  hardware  considerations  for  critical  path  minimization. 
Specific  investigations  of  candidate  DSP  architectures,  including  implementations  in 
Xilinx  FPGAs,  allowed  us  to  evaluate  the  performance  of  several  alternate  approaches. 

Reduced  Scope 

The  IRST/Radar  program  under  which  this  work  was  funded  was  discontinued.  No 
FY2009  funds  were  provided  to  support  the  grant.  Consequently,  the  scope  of  our  work 
was  reduced.  The  original  grant  was  for  $300,000  over  3  years.  The  current  funding  level 
is  $189,  274.  We  were  compelled  to  eliminate  all  third  year  tasks  from  the  original 
proposal  due  to  lack  of  funding. 

Task  1:  Develop  and  analyze  candidate  system  architectures  that  balance 
speed,  space,  and  power 

This  task  was  expected  to  require  approximately  30%  of  the  effort  for  the  proposed 
project  over  the  original  three-year  period.  Consequently  the  candidate  system 
architectures  considered  were  significantly  limited. 

Task  2:  Apply  finite  precision  analysis  techniques  to  the  delta-sigma 
system 

This  task  was  expected  to  require  approximately  30%  of  the  effort  for  the  proposed 
project  over  the  original  three  year  period.  The  principal  results  from  this  work  were 
obtained  in  the  first  year  of  the  proposed  project,  but  we  had  limited  time  to  apply  these 
results  to  the  delta-sigma  system. 

Task  3:  Use  algorithm  and  hardware  innovations  to  minimize  the  critical 
path  in  the  delta-sigma  system 

This  task  was  expected  to  require  approximately  40%  of  the  effort  for  the  proposed 
project.  The  task  was  originally  planned  to  be  addressed  in  the  second  and  third  years  of 
the  project.  This  task  was  significantly  curtailed  due  to  lack  of  funding. 

Background  and  Motivation 

Sharing  of  transmit  hardware  and  array  apertures  between  multiple  simultaneous  users  of 
future  Navy  RF  systems  would  in  principle  permit  tremendous  system  flexibility,  but 
inter-modulation  distortion  (IMD)  products  created  in  transmitter  amplifiers  have  long 
been  assumed  to  preclude  this  possibility.  Such  spectral-purity  issues  are  not  confined  to 
future  systems:  in  key  areas  of  the  world  today  some  fleet  radars  are  not  allowed  to 
radiate;  they  cannot  be  used  at  all.  The  best  long-term  prospect  for  robustly  solving  both 
problems  lies  in  replacing  the  high  power  amplifier  (HPA)  in  the  Transmit-Receive 
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module  with  direct  digital  waveform  synthesis  at  higher  power  levels,  thus  removing  the 
need  for  that  amplifier.  The  only  synthesis  approach  on  the  horizon  that  can  offer  suitably 
low  IMD  and  spurious  levels  appears  to  be  noise-shaping  D/A  conversion,  in  which  a 
digital  delta-sigma  modulator  drives  an  RF  power  circuit  realizing  linear  pulse 
modulation.  Unfortunately,  real-time  delta-sigma  modulators  do  not  exist  at  the  required 
microwave  sample  rates,  nor  are  they  likely  to  be  forthcoming  from  industry. 

Conventional  Nyquist-rate  D/A  converters,  which  attempt  to  spread  quantization  noise 
evenly  across  the  frequency  band,  are  generally  limited  by  nonlinearities  in  the  quantizer 
to  about  14-16  bits  of  resolution  without  the  use  of  expensive  and  tedious  trimming  and 
matching  techniques  [0-3].  Noise-shaped  conversion  [4-6]  uses  a  much  higher  data  rate 
than  the  bandwidth  of  the  signal  to  be  converted,  along  with  a  low-resolution  (often  one 
bit)  quantizer  to  achieve  high  in-band  linearity.  This  is  accomplished  by  embedding  the 
quantizer  in  a  feedback  loop  designed  to  spectrally  shape  the  quantization  noise  and 
effectively  push  that  noise  out  of  the  signal  frequency  band.  The  highly  quantized  output 
is  then  converted  to  analog  via  linear  pulse  modulation,  and  the  out-of-band  quantization 
noise  is  removed  with  a  bandpass  filter.  The  price  of  this  high  resolution  is  not  just  the 
higher  data  rate  (which  is  partially  offset  by  the  simpler  quantizer),  but  also  the  risk  of 
somewhat  unpredictable  instabilities  not  present  in  conventional  converters.  The 
hardware  clock  rates  required  can  be  reduced  using  parallelization  techniques  developed 
at  NRL[7-8]  and  by  using  multilevel  quantizer  implementations^].  The  latter  also 
improves  stability  but  leads  to  inevitable  hardware-mismatch  errors.  Techniques 
developed  at  NRL  [5,6,8]  and  elsewhere  [10-14]  use  a  dynamic  quantizer  implementation 
to  spectrally  shape  mismatch  errors  out  of  the  signal  band  much  in  the  way  that  the  delta- 
sigma  modulator  shapes  quantization  errors.  Many  aspects  of  the  structure  of  parallel  and 
multilevel  quantizer  architectures  are  also  incorporated  in  the  basic  delta-sigma 
modulator,  and  so  implementation  techniques  applicable  to  one  generally  apply  to  all. 

To  implement  such  a  system  using  FPGA  (or  ASIC)  hardware  approaches  requires  a 
thorough  understanding  of  the  algorithms  involved,  as  well  as  the  use  of  innovative 
computer  arithmetic  and  digital  hardware  design  techniques.  We  have  a  great  deal  of 
experience  in  the  hardware  implementation  of  DSP  systems,  especially  concerning  the 
inevitable  tradeoff  of  speed,  space,  and  power.  We  have  previously  worked  with  the  NRL 
on  variable  bit-precision  implementation  of  digital  filters  [15-16],  which  is  also 
applicable  here. 

Approach 

Even  though  we  had  a  significant  reduction  in  funding,  we  made  significant  progress  on 
the  interrelated  tasks  that  comprise  this  work: 

Task  1:  Develop  and  analyze  candidate  system  architectures  that  balance  speed,  space, 
and  power 

In  complex  VLSI  designs,  including  FPGAs  and  ASICs,  there  is  always  a  fundamental 
balance  between  the  maximum  speed  at  which  the  circuit  can  be  operated,  the  amount  of 
chip  area  consumed  by  the  circuit,  and  the  power  consumed  by  the  operating  circuit.  In 
the  proposed  delta-sigma  implementation,  the  circuit  area  will  be  dominated  by  the  loop 
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filter  [5,7],  Our  previously  developed  approaches,  e.g.  [17,20],  as  well  as  those  of  other 
researchers,  was  applied  to  this  particular  problem.  A  naive  implementation  approach 
was  explored  to  estimate  hardware  requirements  for  the  system.  The  nature  of  the  delta- 
sigma  system  requires  novel  applications  of  known  techniques  since  the  filter  is  itself 
embedded  in  a  feedback  loop  which  poses  some  additional  challenges  to  the  analysis  and 
design.  We  have  developed  VHDL  designs  for  the  loop  filter  implementation  based  on 
different  candidate  architectures,  as  well  as  a  VHDL  design  of  a  simple  delta-sigma 
converter  that  will  be  the  basis  of  integrating  our  designs  in  this  project.  We  also  explored 
new  designs,  particularly  for  high  performance  implementations  of  the  loop  filter  based 
on  multiple  constant  multiplication  (MCM). 

Task  2:  Apply  finite  precision  analysis  techniques  to  the  delta-sigma  system 
Theoretical  analysis  of  digital  signal  processing  algorithms,  including  delta-sigma 
approaches,  routinely  uses  double-precision  floating  point  number  representations  for 
simplicity.  Although  some  Matlab™  tools  do  include  limited  capability  for  considering 
finite  precision  effects,  detailed  analysis  requires  custom  simulations.  This  high  level  of 
analysis  is  required  to  understand  the  finite  precision  effects  so  that  they  can  be  used  to 
create  better  implementations.  Specifically,  our  previous  research  [Error!  Reference 
source  not  found.,  19,20,21]  has  shown  that  using  variable  levels  of  precision  can  result 
in  a  better  tradeoff  between  performance  and  space  -  not  all  bits  have  equal  importance 
to  the  design.  We  can  use  our  analysis  to  develop  design  techniques  that  create  more 
practically  realizable  implementations.  Finite  precision  effects  were  considered  for  new 
hardware  implementations.  We  considered  the  effects  of  finite  precision  effects  in  terms 
of  the  coefficient  quantization  and  the  internal  rounding  for  MCM  implementations  and 
designed  approaches  for  implementing  significantly  smaller  circuits  through  the  use  of 
carefully  designed  quantization  algorithms. 

Task  3:  Use  algorithm  and  hardware  innovations  to  minimize  the  critical  path  in  the 
delta-sigma  system 

The  nature  of  the  delta-sigma  problem  yields  a  unique  opportunity  to  manage  the  critical 
path  for  maximum  throughput.  The  feedback  loop  complicates  the  analysis  and 
implementation  of  the  delta-sigma  system.  Although  the  critical  path  does  include  the 
feedback  loop,  the  inherent  timing  constraints  in  the  delta-sigma  converter  still  allows  us 
to  implement  the  FIR  loop  filter  using  high-throughput  pipeline  implementation 
techniques.  Thus,  we  can  calculate  some  results  quite  early,  which  can  be  used  to 
effectively  reduce  the  critical  path. 

Long  Term  Proposed  Research 

We  have  begun  preliminary  investigations  into  the  design  of  sparse  Filters  using  1-norm 
criteria.  Using  results  from  Donoho  used  in  denoising,  we  should  also  be  able  to  develop 
efficient  techniques  for  designing  sparse  filters  using  entropy-based  criteria.  One  possibility 
is  through  direct  minimization  of  entropy  instead  of  energy.  Probably  more  fruitful  would 
be  approaches  based  on  the  Hirschman  Uncertainty,  such  as  an  approach  that  uses  the 
Hirschman  Optimal  Transform  (HOT)  developed  by  V.  DeBrunner  along  with  T.  Przebinda 
and  M.  Ozaydin[22].  Further  improvements  can  be  achieved  by  combining  these  approaches 
with  design  optimizations  for  sparseness,  such  as  that  proposed  by  0.  Gustafsson,  L. 
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DeBrunner,  V.  DeBrunner,  and  H.  Johansson[23].  By  combining  these  sparse  designs  with 
multiple  constant  multipliers  (MCM)  and  computer  arithmetic  techniques  based  on  number 
representation,  hardware  implementations  of  filters  for  a  wide  range  of  high  performance 
applications  are  possible.  The  developed  designs  would  be  of  great  use  in  fixed 
implementations  of  fast  mixed  signal  and  DSP  systems,  but  also  in  the  application  of 
adaptive  filters  in  communications  and  electronic  sensing  (both  passive  and  active). 

Summary  of  Key  Results 

In  the  first  year  of  the  project,  we  developed  a  Multiple  Constant  Multiplier  (MCM) 
design  that  can  operate  at  approximately  320  MHz  and  uses  less  area  that  more  traditional 
design  approaches.  Analysis  of  coefficient  quantization  effects  indicates  that  a  large 
number  of  bits  are  required  for  coefficients  to  achieve  acceptable  performance. 

In  the  second  year  of  the  project  we  focused  on  the  implementation  of  the  delta-sigma 
loop  filter.  We  have  studied  the  impact  of  sparse  coefficients  and  the  use  of  multiple 
constant  multiplier  (MCM)  techniques  to  reduce  the  hardware  requirements  of  the  filter. 
In  addition,  we  have  investigated  the  effects  that  embedded  hardware  blocks,  such  as  the 
Xilinx  DSP48  slice,  have  on  FPGA  filter  designs  optimized  using  MCM  techniques. 

Related  Publications 

Three  Masters  in  Electrical  &  Computer  Engineering  completed  work  directly  related  to 
this  research  project: 

•  Abhijit  Patil,  FPGA  Implementation  of  Digital  Filters  Using  MCM,  MS  EE  thesis,  Florida 
State  University,  2009. 

•  Sean  Patronis,  Sparse  FIR  Filters  and  The  Impact  on  FPGA  Area  Usage,  MS  EE  thesis, 
Florida  State  University,  2008. 

•  Charles  Howard,  Minimizing  FIR  Filter  Designs  Implemented  in  FPGAs  Utilizing 
Minimized  Adder  Graph  Techniques,  MS  EE  thesis,  Florida  State  University,  2008. 

The  accomplishments  of  this  research  and  related  work  are  demonstrated  in  several 
international  publications.  These  publications  give  additional  details  of  the  completed 
work. 

•  Rui  Guo,  Linda  S.  DeBrunner,  and  Kenny  Johansson,  "Truncated  MCM  Using  Pattern 
Modification  for  FIR  Filter  Implementation,”  Proc.  IEEE  International  Symposium  on 
Circuits  and  Systems  (ISCAS),  Paris,  France,  2010. 

•  K.  Johansson,  0.  Gustafsson,  L.  S.  DeBrunner,  "Estimation  of  the  switching  activity  in 
shift-and-add  based  computations,”  Proc.  IEEE  International  Symposium  on  Circuits 
and  Systems  (ISCAS),  Taipei,  Taiwan,  pp.  3055-3057,  2009. 

•  Rui  Guo,  Lei  Wang,  and  Linda  DeBrunner,  "A  Novel  FIR  Filter  Implementation  Using 
T runcated  MCM  T echnique,”  Proc.  Asilomar  Conference  on  Signals,  Systems,  and 
Computers,  Pacific  Grove,  California,  2009. 

•  Kenny  Johansson,  Linda  DeBrunner,  Oscar  Gustafsson,  and  Victor  DeBrunner, 
"Design  of  Multiplierless  FIR  Filters  with  an  Adder  Depth  Versus  Filter  Order  T rade- 
Off,”  Proc.  Asilomar  Conference  on  Signals,  Systems,  and  Computers.  Pacific  Grove, 
California,  2009. 
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•  Charles  D.  Howard  and  Linda  S.  DeBrunner,  "High  speed  DSP  block  for  FPGA  devices 
using  a  programmable  adder  graph,”  Proc.  IEEE  13th  Digital  Signal  Processing 
Workshop  and  5th  IEEE  Signal  Processing  Education  Workshop,  DSP/SPE  2009,  Marco 
Island,  Florida,  pp.  490-494,  2009. 

•  Sean  Patronis  and  Linda  DeBrunner,  "Sparse  FIR  Filters  and  their  Impact  on  FPGA 
Area  Usage,"  Proc.  Asilomar  Conference  on  Signals,  Systems,  and  Computers,  Pacific 
Grove  California,  pp.  1862-  1866,  2008. 

•  Charles  D.  Howard,  Linda  S.  DeBrunner,  and  Victor  DeBrunner,  "Hybrid  Multiple 
Constant  Multiplication  Implementation  for  FIR  Filters  in  FPGA  Devices,"  Proc. 
Asilomar  Conference  on  Signals,  Systems,  and  Computers,  Pacific  Grove,  California,  pp. 
1422-1425,2007. 

•  Linda  S.  DeBrunner,  "Reducing  Complexity  of  FIR  Filter  Implementations  for  Low 
Power  Applications,"  Proc.  Asilomar  Conference  on  Signals,  Systems,  and  Computers, 
Pacific  Grove,  California,  pp.  1407-141 1,  2007. 

•  Oscar  Gustafsson,  Linda  S.  DeBrunner,  Victor  DeBrunner,  and  H3kan  Johansson,  "On 
the  Design  of  Sparse  Half-Band  Like  FIR  Filters,"  Proc.  Asilomar  Conference  on 
Signals,  Systems,  and  Computers,  Pacific  Grove,  California,  pp.  1098-1102,  2007. 
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